english pronunciation
Whisper based Cross-Lingual Phoneme Recognition between Vietnamese and English
Minh, Nguyen Huu Nhat, Anh, Tran Nguyen, Dung, Truong Dinh, Van Nam, Vo, Tuyen, Le Pham
Cross-lingual phoneme recognition has emerged as a significant challenge for accurate automatic speech recognition (ASR) when mixing Vietnamese and English pronunciations. Unlike many languages, Vietnamese relies on tonal variations to distinguish word meanings, whereas English features stress patterns and non-standard pronunciations that hinder phoneme alignment between the two languages. To address this challenge, we propose a novel bilingual speech recognition approach with two primary contributions: (1) constructing a representative bilingual phoneme set that bridges the differences between Vietnamese and English phonetic systems; (2) designing an end-to-end system that leverages the PhoWhisper pre-trained encoder for deep high-level representations to improve phoneme recognition. Our extensive experiments demonstrate that the proposed approach not only improves recognition accuracy in bilingual speech recognition for Vietnamese but also provides a robust framework for addressing the complexities of tonal and stress-based phoneme recognition.
- North America > United States (0.04)
- Europe > France (0.04)
- Asia > Vietnam > Hồ Chí Minh City > Hồ Chí Minh City (0.04)
- Asia > Vietnam > Da Nang > Da Nang (0.04)
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Xu, Ancheng, Tan, Minghuan, Wang, Lei, Yang, Min, Xu, Ruifeng
Numeral systems and units of measurement are two conjoined topics in activities of human beings and have mutual effects with the languages expressing them. Currently, the evaluation of Large Language Models (LLMs) often involves mathematical reasoning, yet little attention is given to how minor changes in numbers or units can drastically alter the complexity of problems and the performance of LLMs. In this paper, we scrutinize existing LLMs on processing of numerals and units of measurement by constructing datasets with perturbations. We first anatomize the reasoning of math word problems to different sub-procedures like numeral conversions from language to numbers and measurement conversions based on units. Then we further annotate math word problems from ancient Chinese arithmetic works which are challenging in numerals and units of measurement. Experiments on perturbed datasets demonstrate that LLMs still encounter difficulties in handling numeral and measurement conversions.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Guangdong Province > Shenzhen (0.04)
- Europe > Italy > Tuscany > Florence (0.04)
- (8 more...)
Eida: English Coach - Apps on Google Play
Eida is an English language coach that helps you practise English grammar, English vocabulary, English pronunciation, English listening, English writing, reading in order for you to improve your English vocabulary, your English grammar, your English pronunciation and speak English fluently. Chatbot to learn English: it a language assistant that acts like an English chatbot to help you study English and speak English perfectly and fluently. With Eida English Coach, you have the opportunity to listen to native voices and speak exactly like them. English listening and speaking: Eida English coach gives you the possibility to tap and listen to a native speaker and to record your own voice and test it to know if it is right or wrong. Learn English grammar: Eida presents various grammatical rules on conjugations of verbs, English phrasal verbs, English pronouns, English conjunctions, English prepositions and many other English grammatical rules to improve your English grammar.